The LIA-EURECOM RT‘09 Speaker Diarization System
نویسندگان
چکیده
This paper presents LIA-EURECOM’s joint submission to the NIST Rich Transcription 2009 (RT‘09) speaker diarization evaluation. We describe a number of modifications to our previous system which involve beamforming for the multiple distant microphone (MDM) condition and also significant enhancements to the speaker segmentation stage of the core speaker diarization system. These modifications lead to improvements in both speech activity detection (MDM only) and also to overall diarization performance. We present experimental results on a development set of 23 shows and the RT‘07 dataset, which was used for validation. Experimental results on the latter show a relative improvement in DER of 27% is achieved with our new system on the MDM condition. Similar experiments on the RT‘09 dataset show a relative improvement in DER of 35%. Our results for the MDM condition compare reasonably well with those of others even if, other than for beamforming, we did not use any delay features. Results for the single distant microphone condition (SDM) compare especially well with others’ work and highlight the merit of our top-down, evolutive hidden Markov model (E-HMM) approach to speaker diarization.
منابع مشابه
EURECOM submission to the Albayzin 2016 Speaker Diarization Evaluation
This paper describes the speaker diarization system submitted by EURECOM for the Albayzin 2016 speaker diarization evaluation. This evaluation consists of segmenting broadcast audio documents according to different speakers and attributing those segments to the speaker who uttered them, without any prior information about the speaker identities nor their number. EURECOM system is based on the b...
متن کاملThe LIA RT'07 Speaker Diarization System
This paper presents the LIA submission to the speaker diarization task of the 2007 NIST Rich Transcription (RT’07) evaluation campaign. We report a system optimised for conference meeting recordings and experiments on all three RT’07 subdomains and microphone conditions. Results show that, despite state-of-the-art performance for the single distant microphone (SDM) condition, in its current for...
متن کاملELISA nist RT03 broadcast news speaker diarization experiments
This paper presents the ELISA consortium activities in automatic speaker diarization (also known as speaker segmentation) during the NIST Rich Transcription (RT) 2003 evaluation. The experiments were achieved on real broadcast news data (HUB4), in the framework of the ELISA consortium. The paper firstly shows the interest of segmentation in acoustic macro classes (like gender or bandwidth) as a...
متن کاملNIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings
This paper presents different pre-processing techniques, coupled with three speaker diarization systems in the framework of the NIST 2005 Spring Rich Transcription campaign (RT’05S). The pre-processing techniques aim at providing a signal quality index in order to build unique ”virtual” signal obtained from all the microphone recordings available for a meeting. The unique ”virtual” signal relie...
متن کاملStep-by-step and integrated approaches in broadcast news speaker diarization
This paper summarizes the collaboration of the LIA and CLIPS laboratories on speaker diarization of broadcast news during the spring NIST Rich Transcription 2003 evaluation campaign (NIST-RT 03S). The speaker diarization task consists of segmenting a conversation into homogeneous segments which are then grouped into speaker classes. Two approaches are described and compared for speaker diarizat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009